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ABSTRACT 


A categorical analysis was made of five languages. The 
sounds of speech were simulated using written text converted 
Via International Phonetic Alphabet (IPA). The sounds of 
speech were identified as members of fricative, nasal, stop, 
or vowel categories. A statistical analysis was performed 
on categorical content of one (at various positions in the 


word), two, and three sound combinations. 


Several attempts to achieve a differentiation scheme were 
made before any success was realized. Two methods for 
developing conditional expectation are compared; Bayes! 
Conditional Probability Rule, and Cook's Prognostic 


Progression. 


Statistical analysis and “loop" tests indicated that 
languages do have unigue patterns and can be differentiated 
on the statistics contained in the first three sounds. 100% 
correct decisions were achieved for as few as five words in 
the loop test. Limited base data negated result significance 


beyond three successive sounds. 
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I. INTRODUCTION 


A. LANGUAGE PROBLEMS, GENERAL 


There has existed for a great many years the desire and 
requirement to differentiate languages both categorically 
and specifically. It is relatively easy to accept that a 
person can mentally detect an unknown foreign language and 
guess (often accurately) its origin. However, this mental 
process has not been successfully duplicated using 


mechanical/felectrical devices is spite of research efforts. 


Language barriers have historically been major oppcnents 
of international well-being. Interpreters bridged many 
gaps, however, the lack of multi-linguists necessitated the 
need for a large number of bi-lingual Specialists. 
Therefore, a pool of interpreters is required to conduct the 
routine business of international relations. Linguist 
selection is Simplified for some physical characteristics, 
however, it would be quite embarrassing to select a Chinese 
translator for a Korean dignitary. Modern voice 
communications eliminate all physical characteristics except 
those representative of the speaker and his language. The 
question then, was whether the sound content itself 
contained sufficient information of a unique quality 
(language dependent) so as to make it distinguishable fron 
all other possibilities. Certainly it is desirable to 


establish communications expediently and correctly. 


Languages are coded utterances, therefore, it seems 
possible to differentiate the languages by recognizing the 
type and quality of code being used. A "guess" as noted 
above has some merit, but lacks the continual precision 


available in a machine (computer). 
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Be. WRITTEN LANGUAGES 


Written communications have been developed based on 
either the interpretation of the coded sound or a picture 
representing the item one desires to convey. Not 
surprisingly there has been a move in the interest of 
internationalism to convert the pictorial forms of sone 
written languages to the Romanized script. Most apparent 
was the recent introduction of "PINYIN" to the Chinese 
language {Ref. 1]. One intent of the conversion is the 
introduction of phonetics to the school systems to establish 
proper pronunciation. The accepted international languages 
are English, French, and Spanish. Since these languages are 
Romanized, the conversion from pictorial to Roman suggests 
that future generations will be less burdened in mastering a 
second language. In the conversion the ideograms are 
phonetically transcribed using the IPA (discussed in section 
eT) 


Phonetics, then, 1s a basis through which sounds are 
transcripted and establishes a common base between the 
spoken and written word. It is the purpose of this paper to 
establish the feasibility of automatically differentiating 
languages based on the average phonemic content over various 
input lengths. The source data used was various phonetic 
transcriptions of the selected test languages. The 
reference data 1s based on transcripted word frequency (for 
usage) lists, some text selections, and conversation samples 
for each of the selected test languages. The reference data 
is used to establish matrices of probability which measure 


the sound types and sequences in languages. 


10 


- | AA 
OS / 
- _ 
7 7 
7 » - 
/ . . 4. : 
a ’ 7 / Ss 
- bed 7 
a : . 
7 i. 
7 7 7 yo 
- i ; - : 7 > : 
. : 
we - ‘ 
. - 
7 i 7 - - ay 

















. q > Pe, 7 
; 
oe oO > ; . i : 
- : ; 7 ~~~ 492 Tee 
7 
— : @ - 7 4 ‘ ae 
: 0 f , Yee - 
1 es Ris) (oea Se 
y 
(© es yy” ° 7 
- - 
> 7 : 
y 
wee ~. : ip _ J: 
2% - on <4 f x " 
: | 7 a oe ne 
- = ore 
a 7 ia eae a) 


O 
7 . (4 ee = : - - ie 7 re eV 






ee on ee 7 7 ; 
7 7 _° 
- 
- nn 7 a a an 7 ee ‘a, 
- S@* wy < a a tz 
- i . ™ os my +493 
an owns - 7 i 
a ns anh a : : > 
ae teow 7 
= a 
a = 
- 





C. SPEECH 


Speech, as stated earlier, is any length of coded 
utterances. An utterance 1s herein defined as the forced 
emission of modulated air from an oral or nasal orifice. 
Coding implies that some means is applied to modify the air 
from its zero. state. The zero state is defined as 
unmodulated inhaled or exhaled air, 1.e., normal breathing 
patterns. The modification or modulation of the air is 
achieved by the mechanisms of speech to produce audible 
sound waves. The mechanics of speech are discussed in some 


detail in Appendices A and B. 


The coding or modulation of air flow is classified into 
the major categories: vowels (V), stops (S), fricatives 
(F), and nasals_ (N). These categories are further 
subclassified into individual sounds which are unigue in 
their generation and in the characteristics of the modulated 
pressure wave output. A more detailed description of these 


categories can be found in section 2 of Appendix A. 


The unigueness of certain sounds has been a detrimental 
aspect to experimenters in the area of synthetic voice 
generation. Conversely, certain categories have striking 
Similiarities which allow simulator redundancy. In the 
vowel category, Dr. Rosenblum {[Ref. 2] demonstrated how one 
vowel sound could be converted directly to another by means 
of selective formant filtering. Other works with specific 


sound categories are discussed in section 3 of Appendix A. 


The pause is a special case of coded output, and is an 
essential element in speech analysis. Potter, Kopp, and 
Kopp { Ref. 3] interpreted long strings of sounds as recorded 
On a spectrogram after they had first recorded the series 
"word-- pause--word--pause" spectrograms and were sure where 


the word boundaries occurred. In speech the pauses provide 
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a starting and ending point for analysis. The time between 


pauses 1s random, as is the actual length of the pause. 


The data between the pauses 1s composed of sounds from 
the major categories, and for this analysis it was assumed 
that the sounds could be identified at least categorically, 
and removed for analysis. The actual conversion of spoken 
speech into a form for analysis was accomplished indirectly 
via phonetic transcriptions which is the subject of part 
Iil. 
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In the 90th meeting of the Acoustic Society of America, 
150 different papers were presented on various research 
efforts being done in the voice/speech field [Ref. 4]. The 
Speakers varied from language teachers, phoneticians, etc., 
to medical doctors, psychologists, physicists, and 
engineers. They represented various universities, and 
private organizations from the United States and abroad. 
There were several papers which looked into differences in 
languages, but were microscopic in nature. They were 
concerned with the specific functions of the system which 
made certain vowel or consonant sounds in one language vary 
from another, i.e., why a palatilized "i" sound varied from 
the non~palatilized "i." More specifically is it only the 
palatilizaticn or are other factors involved. For example 
some looked at sound clusters of vowel-consonant 
combinations {(VCVVCCV), and focused on the fact that 
certainly a palatalized "i" is forced by the sound 
preceeding or succeeding the "i" sound. Other studies were 
Slanted toward speaker recognition schemes. Some were done 
in languages other than their own, while others looked at 
recognition criteria in their native languages. These 
latter results in general hinted that language dependent 
factors do exist. Note that when transcriptions are used 
speaker dependence is somewhat removed, i.e., the method of 
speaking (dialect) is retained, however, the properties 
unique to the speaker (fundamental frequency, muscle 


impairment, nose size and shape, etc.) are absent. 
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Bir: PHONETIC TRANSCRIPTIONS 


A. HISTORY 


1. 


1G) 


eneral Background 
Sing symbols as a means of communications goes 





G&S 


back through several civilizations. It 1s almost academic 
to determine whether hieroglyphic styles came prior to, 
concurrently or after vocalized descriptions. The fact is 
that a means to convey thought other than by vocalization 
was required, and developed. The various alphabets’ used 
throughout the world have certainly proven themselves able 
vehicles inside of their boundaries, but simultaneously 
represent an additional hurdle to commonality across 
language boundaries. There are approximately thirty 
different alphabets currently being used throughout the 
world. It was not documented until the ninteenth century 
that the presently used alphabets were deemed to be 
deficient. Potter, Kopp, and Kopp [Ref. 3, p. 1] point out 
that Melvin Bell (father of Alexander Graham Bell) developed 
a set of symbols which specified the pronunciation of words 
so accurately, that anyone familiar with the system could 
repeat the word precisely. Bell's contribution was 
primarily aimed as an aid for the deaf, although it also 
made a definite impact on the fields of language, phonetics, 
and the electronic analysis of speech; eé.g., the sound 
spectrograph was certainly a by-product of the search for a 
visible means to convey what is heard. A brief description 


of the sound spectrograph is given in Appendix A. 


2. Development of the IPA 
Language teachers, in the mid 1800's, recognized 
certain limitations in their ability to teach froper 
pronunciation using traditional alphabets. Many teachers 
circumvented these limitations by developing symbols to 


describe the type of sound required. As communications and 





transportation technology bridged the gaps between teachers 
they began to discover that others were having problems 
similiar to their own, thus informal teacher collectives 
were formed. The International Phonetic Association [ Ref. 
5] attribute their origin largely to the efforts of Paul 
Passy who headed a smalt group of language teachers in 
Prance in 1886. Passy and his colleagues found that the 26 
characters in the Roman alphabet were largely inadequate 
when trying to teach proper pronunciation. Many sounds for 
the same letter or combination of letters were in fact 
generated in the vocal system with distinctly different 
mechanics. Armed with this information, they drew up a new 
alphabet to fulfill their needs. The new alphabet was 
composed entirely of symbols each of which represented a 
unigue sound. Their new alphabet was fullv defined and used 
throughout the text of the first publication of the 
periodical "Dhi Fonetik Titcer [Ref. 5]." Others were drawn 
by this unigue idea and subsequent events resulted in the 
formation of the International Phonetic Association. The 
work of Samuel Bell was done independently of the European 
group, and his symbols actualy resembled hieroglyphic 
techniques, rather than an expanded version of the Roman 
alphabet, which was the objective of Passy. Therefore, 
Bell's alphabet faded with time. The association grew as 
numerous inputs were made from various other countries and 
field-phoneticians. Eventually a collection of phenetic 
transcriptions of languages (English et al) were assembled, 
and the IPA was expanded to include sounds unique to 
previously unavailable language transcriptions. This work 
across language boundaries enabled many languages to be 
taught much more easily both in native and second languages. 
Heretofore, tktanguages could be described by the basic 
parameters as usual using the unigue structures and 
additionally by those sound production technigues unique to 
a specific language. 
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In the IPA every symbol has a distinct sound associated 
with it. There are some 89 different symbols encompassing 
all but the African languages as noted by Pr. Francis 
Cartier [Ref 6}. Only 43 of these svmbolsS appear in 
American English, (depending on the reference used; general 
range is 42-46), which makes its transcription much easier. 
In "The Phonetic Alphabet," Dr. Cartier describes in detail 
the techniques of transcripting the English language, and 
has a small treatment of other languages and non-sense words 
(Ref. 6]. In order that the technique for this research be 
expandable to eventually include all languages a computer 


version was developed for 101 IPA characters. 
B. THE TRANSCRIPTION 


A transcription consists of writing (in symbols) the 
sounds one makes when speaking. This can be accomplished 
quite effectively (after training) in either a native 
language or a language for which adequate knowledge of 
proper pronunciation is known first-hand. If the language 
1s foreign to the transcriptor there is considerable roon 
for error, particularly if sessions cannot be spread so as 
to keep the speaker, interpreter, and transcriptor ina 
fresh state of mind. If a transcriptor is transcripting 
from an unknown language, which appears close to one he is 
vaguely familiar with he will tend to do the transcription 
in the symbols and forms associated with the most familiar 
language until he has a key to _ the language being 
transcribed, then he will retrace and make whatever 
corrections are required to Maintain the quality 
(uniqueness) of that language [Ref. 7]. This could lead to 
an erroneous transcription 1f a key group of words were not 
located, and/or the language had not been the subject of a 
detailed analysis to determine which if any sounds’ were 
definitely unique to that language. Another problem 


presented along these same lines 1s dialect. The simple 
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word "the" can be properly pronounced several ways depending 
upon the geographical area of the world in which the word is 
being used. Therefore, it is possible for the transcriptor 


to write 0i when he really should have written da. 


C. SOURCES AND USES 


The transcriptions used for this paper were obtained 
from a variety of sources. Most of the test text portions 
were taken from "The Principles of the [International 
Phonetic Association, 1949." There is a second group of 
text particularly done in a conversational mode type 
obtained from Micholin Ponder of the Research Department of 
the Defense Language Institute, Monterey, California. 
Additional samples of conversational text was made available 
by Dr. R. Wohlford, and particularly useful English samples 
were taken from "Informal Speech" [Ref. 8}. A sample of 
Spanish was contributed by Dr. Maria Baird, Chairman, 


Spanish Department, DLI, Monterey, Ca. 


There are also several dictionaries which use the IPA 
symbols for their pronunciation guides. One such source for 
American English is "A Pronouncing Dictionary of American 
English," by Kenyon and Kott [Ref. oi. Several foreign 
dictionaries were scanned as well as language textbooks, in 
most cases the IPA was used sparingly, making it extremely 


difficult to collect a satisfactory representation. 


a 





IV. DIFFERENTIATION DEVELOPMENT 

The actual classification of specific sounds has 
historically been achieved first by aural perception, and 
later by electronics devices such as the oscilloscope, the 
Spectrograph, and the computer. All of these devices 
required manual intervention. That is, the sounds are 
processed and recorded so that they can be tangibly observed 
and interpreted for content, classification, etc. Presently 
no real time conversion exists for the 
interpretation/classification domain. Developing a routine 
or algorithm to directly convert spoken speech into some 
form of usable data does not appear infeasible. There are 
currently research projects being conducted in this area. 
Hess [Ref. 10], although not specifically oriented to 
language differentiation developed a recognition model whose 
technigues and results may in fact be very applicable to a 
real time classification converter. For the purposes of 
this research it was assumed that such a device couid be 


produced. 


Although it is not clearly understood how the mind can 
differentiate languages, it seemed that a model could be 
developed to produce this differentiation function, not ona 
"guess" basis but with some reasonable assurance as to which 
language was being used. Here, it was felt that the sound 
content might be the key. The research is based on these 
assumptions and the belief that the IPA represents an 


appropriate sound classification capturing vehicle. 
Ae DATA BASE SELECTION 

The first problem was to determine a satisfactory data 
base from which one could establish the categorical sound 


statistics. Considerable research has been done on 


frequency factors for use of words under such conditions as: 
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Ordinary (informal) conversation, formal speaking (addresses 
etc.), informal writing (letters), and formal writing 
(textbooks, newspapers, periodicals, etc.), and these. lists 
are available in a number of languages. After obtaining 
such lists it became necessary to convert first into the 
Roman alphabet, and then to transcribe from the Romanized 
script into the IPA symbology. As discussed above in the 
transcription section, this did not propose an easy task for 
a non-native novice transcriptor. Therefore, solicitations 
for assistance particularly from DLI, were abundant [{refs. 
11, 12]. The question arose concerning the possibility of 
error introduction when using word lists. The major error 
would seem to appear when the factor of frequency is not 
Mietized within the list itself, i.e., if the frequency of 
the word were 1/10 then one would expect every tenth word in 
the list to be the same. A further exampie is drawn fron 
the Markov process. One such use of the Markov process is 
to establish the frequency of occurrence of letters 
{singularly and in combination) for the given language. 
Then a string is developed using those frequencies {note 
that in English one considers 27 characters to include the 


pause). A typical string would appear as: 
BAMANT Perse. GEL-——-——————— etc. 


The frequency of use is a must in this case, and is 


generally determined by: (for the single letter case) 


Frequency (desired letter) = ——— aSehSE}—— (1) 
etters 


Obviously the string of characters does not make _ sense. 
However, when one looks at the strings generated by the two 
letter combinations [of which there are (27)? or (n)#], they 
may find some words (simple) begin to appear. Using the 


three letter comhination level (n)3 more words begin to 
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appear. The process could of course continue ad infinitun 
and at each step would generate more complex words and 
perhaps even paragraphs. An analysis very similiar to the 
Markov Chain type, just described, was used by Rau ([ Ref. 
13], to develop a process by which short samples of written 
text could be automatically differentiated. Rau looked at 
two languages, and achieved results, ranging from 51% to 
100% accuracy factors which were proportional to the length 
of the "short samples" he used [Ref, 13, p. 21, ff]. In the 
case of word lists, it would appear that by not at least 
using a Markcv process something would be lost. However, in 
the early testing, the single listings of words generated 
data very close to that of various random text and 
conversation inputs. Therefore, it was concluded that a 
Significant error was not introduced py using the most 


common words independent of their frequency factor. 


B. COMPUTER LANGUAGE DECISION 


It was apparent that data processing should be 
accomplished by a computer but the choice of a language 
(compiler) was not immediately obvious. FORTRAN IV was 
attempted, but rejected because of the conversions reguired 
in the input/output (I/0) for the raw data. The language 
SNOBOL IV (used in conjuction with SPITBOL) is probably the 
best language [Ref. 14], however, arrangements for its 
installation could not be achieved in time to be effective 
for this research. ALGOL W contains qualities of both 
FORTRAN and COBOL, and was ultimately selected because of 
its inherent ability to process both text and equations. 
The initial program was developed by M. C. Thomas[Ref 15], 
using strictly English data, and a special alphabet 
conversicn code (ROMI) consisting of the 44 [PA _ sounds 
associated with the English languages. The program 
developed the statistics for the first sound, the first two 


sounds, first three sounds, and the first four sounds of the 


Z0 







a . ae or 7 


nan 


= 


7 oe = 










; | 
= a en oe 
pe ae ee a ee ee tt 
i aon 
1 rt eee acs Pr ae on 


se ae 7 - 


a a aaa i am 












ee 


a ee a 
a a 





{+ 7 
a - a 
eee 
~~ a > 


data input. The statistics were developed in the program by 
reading in the data word by word, then converting the data 
into the sound categories (F,N,S,V), via ROMI, letter by 
letter. The converted words were processed to extract the 
appropriate statistics to produce the data shown in computer 


output 1. 


As languages outside of English were drawn into the data 
base, ROMI had to be increased, and was ultimately expanded 
to its present state of 101 characters. The number of 
symbols (101) decided upon was not entirely the result of 
the different languages introduced or tested, rather the 
number was selected as being the universal set recognized by 
the International Phonetic Association {Ref. 5], plus some 
diphthongs uniquely identified by Cartier and Todaro f[ Ref. 
6 j. Thus KOMI is cavable of converting any language 
subjected to the models. The ROMI to IPA conversion is 


listed by categories in the appendix C tables. 


Five languages were selected at random from among those 
for which frequency tables and translators or previously 
transcripted materials were available. Theoretically the 
languages are independent variables to any algorithym and 


thus were designated A, B, C, D, and E. 


C. FIRST SOUND ANALYSIS 


Results similiar to the first data run are shown in 
computer output 1. The sound pattern of the first letter 
per word (or phrase, in the conversational cases) showed 
sOme promise of a pattern, even though they were bunched 
together at the lower end of the scale (see figure 1). The 
words were separated from the phrases to prove (or disprove) 
Similarity, and similarity did occur, thus disproving that 
an error factor was introduced because of using word lists 


without frequency considerations. 
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A decision table was drawn up using ranges established 
by figures 1-a and 1-b. A new set of data for a_e single 
language was then analyzed and its results compared with the 


table. The test data is indicated by the X's in figure 2. 


The results of the single test indicated that the sample 
size was inadeguate. Additionally, it appeared that the 
ranges would increase proportionately with the sample size, 
unless a convergence could be determined. If the ranges 
were to increase, there would be an even greater overlapping 
of regions, and decisions would be made more difficult. 
Therefore it was concluded that differentiation was not 


feasible on the basis of a single sound. 
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Figure 2. First Sound Decision Table 
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D. NORMALIZATION CROSS-SECTION 


The second approach dealt with observing a cross-section 
of the languages collectively on the basis of categorical 
content. This method of analysis presumed that the sounds 
would be randomly distributed in a "Gaussian" fashion. The 
decision factor selected was the Shannon Theorem [Ref. 16, 
chapter 3}. In the Shannon theorem the data is assumed 
Gaussian, and three situations occur: (1) the curves are 
disjoint and the decision is immediately obvious; (2) the 
curves totally overlap and no decision is possible; or (3) 
there is ae slight overlapping and some criterion can be 
developed which will give reasonable predictions of the case 
at hand, providing the error factor (amount of overlap) is 


not severe. The three situations are: 


£ (x) £ (%) £ (x) 
AST \. ee ea eX 
case (1) case (2) case (3) 


Figure 3. Shannon Theorem Decision Curves 


It was anticipated that either case (1) or (3) would 
emerge. Therefore, the statistics obtained from the 
computer printout were "normalized" or "Gaussianized" in 


accordance with the following: 
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X = the % of occurrence of a category 
in the first three and last two 
sound positions of a word. % => frequency. 


E(x) = ae (2) 
a(x) = | —=-C4_=_£ {x1 12 bail (3) 
mex) = 


fe — ly (Nes eee) » (a- ly), “(). 


In the case of the fricatives for language "A" the 
content (from computer output 1) was: 


X = Random Variable 
; = frequency of 
sound Location occurrence of F 
ss ee es a ae oo Vee Caammee 
2 00.98 
Fs 1: 46 
n-1 15.46 
n 01.35 
The computed mean is: 
foe 33.408 + 0.98 + 17.486 + 15.46 + 1.35 
— 22272 =225-—-- SUID tO * te DL 


135,75 


Similarily the standard deviation was calculated to be: 
o(x) = 10.22 
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Then using table 6.1 [Ref. 17] the Gaussian density (pdf) 
for fricatives was established for language A. Using 
identical procedures the fricative densities were created 
for the remaining four languages and superimposed on the 
graph for language A. The same procedure was followed to 
establish graphs in each category. The tables and graphs 
for this procedure are contained in appendix D. The results 
were clearly not what was expected. In fact, Shannon's case 
(2) 1s approached in each graph. Even in the graphs where 
case (3) was there, in spirit, the error factor is clearly 
greater than the Shannon theorem could accept. It was 
speculated that increasing the reference data would not be 
sufficient to Obtain more dispersion in the graphs. 


Therefore, this technique was 4bandoned. 


E. TWO-SOUND PROBABILITIES 


The computer program which develops the statistics 
creates them such that the frequency information follows the 


tree branch technique, as shown in figure 4. 


The frequencies registered at each point in the tree is 
an exact probability. Therefore at every sound level for 
each language the probability per language is available. 
These probabilities are grouped by language in computer 


Output 1, and are summarized in table D-I, in appendix D. 


/ 


26 





Nx 
v 
Ss Say 
ve ie 
rm 
Fa] 
Na. | nS, $4) = 
- is $3 
= va, Va | ‘es ea 
a | S 
| LS ot 
rai Fe | Fy | Se Va 
Bal $2] 
; Ea, = 
he cs S| Fu 
Sa So A Mae 
We | V3 | 
L- SND 2-34D 3- SND Da - A- SND 
) Cie) 4) @u) 95” 4) 
Figure 4. The Frequency Tree. 
Assuming that all languages are equiprobable 


(P(A) =P (B) =P (C) =P (D) =P(Z)=1/5], Liklihood ratios can he 
developed using Eayes rule; which computes the probability 
of a given language based on the categorical sound occurring 
at a specified position in the word. For example the 
probability of a test word being from language "X" given 
that a sound category was identified in position two (second 


sound) would be computed as follows: 


PO) * err Rie errors © 


where: X = language = A, B, C, D, £E 
Yz2= sound category = F, N, S, V 
1 = position indicator = 1, 2, 3, 4, n-1, n 
j = position indicator = 1, 2, 3, 4%, n-l, non 


These probabilities are tabulated in table IV-I. A decision 
can be achieved using the naximum likelihood. For example, 
from table IV-I, for an "S" in the 2nd position one would 


choose language E because it clearly has the the greatest 
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probability of being the correct choice. In the case where 
more than one word from the same language is being analyzed 
the decision criteria requires an additional iteration; this 
presumes the words have different sound content arrangements 
(order). When the position is selected (say at the second 
sound position) the rows are removed in tact. The removed 
rows are then isolated and a product is calculated for the 
column elements. The correct choice is then the columnar 
product which is largest, i.e., has the greatest likelihood. 
Multiplication corresponds to the assumption that words are 
chosen independently at random from the language. fFor 
example, given a first word has a "V" in the last position 
the maximum likelihood would elect language B, however, if 
the second word had an "S" in the last position and the 
product is taken, language D will have the owaximun 
likelihood. 


a? > <> a ee a ee a Se) cS ee eg) ap ee ees SS eee See ee 


V 074 aon aoe - 126 - 258 

2220 -000 o ae ~491 So 
product -016 -000 084 - 062 039 
sun 294 a3 - 466 0.17 2411 


Using the product method, clearly eliminates any 
language which is devoid of a unigue sound. The advantage 
of the summing method is that languages which have high 
percentage of occurrence are brought out for consideration 
in spite of a categorical void. Arguments can be 
established on which is the better method (sum or product), 


however, it was decided to use the product method in 


subsequent decisions. ine Peeauctsor one On Bmore column 
elements is referred to as "validity factors" in computer 
peogram 2; aepreoduct. OL Validity facters are ~ called 


Pvalidity products.” 
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In spite of the closing statement in section IV-C, that 
differentiation was not feasible on the basis of a _ single 
sound, testing was performed on some 200 randon 
words/phrases. The results (not included) were ambiguous 


and supported the earlier clain. 


Bayes Rule still offered more potential than previous 
efforts, therefore, the two-sound combinations [of which 
there are 16; F, N, S, V => (4)2], were computed in the 


following manner: 


BIE Y 2%, ) = — <P A— (6) 


X=A,E 
where: X = language = A,B,C,D,£E 
Y = sound category = F,N,S,V 


P(X) = P(A) =P (B)=P (C) =P (D) =P (E), 
by assumption 


These values are given in Table IV-II. 


The decision rule is identical to that for one-sound, 
1.e@., maximum likelihood. Given any word(s) greater than 
one sound, a decision can be reached. Again two or more 
words dictate the product (or sum) rule to obtain the 


maximum likelihood. 


The results of limited testing (30 words) using this 
method were ktelow the desirable limit. In the case of one 
random werd, decisions were less than 20% correct; for two 
words less than 50% correct; and three word decisions ranged 
from 45-55% correct. Though the correct decisions were 
proportional to the number of words, the decision procedure 
appeared to be weak. Presuming this was a result of the 
earlier assumption that all languages were equiprobable, a 
procedure similiar to Bayes was constructed and defined as 
follows: 
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wheres X = eee es 
a = F,Ny;SPY 


Again the assumption of equiprobable languages was inserted 
thus removing the [P(X) ]? terms. The remaining equation is 
hereby defined as "Prognostic Progression." The equation 
calculates the "Logical Prognosis" of a language given the 
frequency of occurrence at a position and the probability 
{or Logical Prognosis (L) ] brought forward from the previous 
position in the tree (fig. 4). Table IV-III was calculated 
using this procedure, and is compared with the Bayes 
solutions in Table [IV-IV. The results are interesting. 
They show that the resultants are shifted, from one table to 
the next, and tend to negate the assumption of language 
equiprobability. Thus the probability of language arrivals 
1s introduced, but indirectly, at any selected level except 
the first (single sound). This enables the use of variable 
probability for reference languages, Cre unknown 
reference sample sizes are inherently accounted for. Three 
comparative tests were run for Tables IV-III versus IV-II 
using 100 randomly selected words (20 per language). The 
results showed for the two-sound level that the formulas had 
relatively the same erfect. The overall results achieved 
were: (note, 100 word samples increased minimum correct for 
this test to 40% compared to previous 20$) 


SSD ES SS EE ES ES SE SD SE ee ee a em 


% CORRECT 
Procedure test 1 test 2 test 3 
BAYES 39 4Q 38 
COOKS | 39 4O 


> ap SD ED <p SD ce DD DD ee ee ee ee ees ee SD ms. a De ee 
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Further test at the three-sound level were conducted; 
results very similiar to the two-sound results were 
obtained. The number of correct choices were within 1--4% 
of each other, however, it was noted that in4 of 7 tests 
(avg. words = 35) the magnitude of the correct over the next 
competitor was much greater with Cooks method. 
Additionally, the order of magnitude from the most likely 
(maximum froduct) to the second most likely, second to 
third, etc., was much more even, and smaller in Bayes than 
in Cooks. It was thus assumed that the idea of "Prognostic 
Progression" was at least equivalent to Bayes Rule. This 
assumption encouraged the writing of computer program number 
ane 


The basic operation of computer program 2 is a 
continuation of program 1, except that some extra features 
were added. The primary extra feature 1s variable 
partitioning, wherein words of the same sound length are 
analyzed collectively by language. The partitioning is 
variable in that the statistics can be ended at any point, 
1.e., all words 21, create the same data as in computer 
output 13; words 22, creates data for one sounded words, 
apart from the data for words of two sounds and greater; 
this process is allowed through words 26. The decision 
portion forces a word to follow a flow path based on the 
number of sounds the word contains. By doing this, for 
example, three sounded words are only compared with all 
other three sounded words in the data collection, unless the 
the partitioning is set at 22 or 23. The standard test 
(comparisons) in the flow path for a partitioning of 26, is 
provided in table IV-VII, and the program flow diagram is 
provided in figure 5. Secondary features include selectable 
graphic representations of either the raw statistics, or the 


Prognostic Progression values for any or all reference data. 
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The model was tested using randomly selected words from 
each language within the data base. This "loop" test was 
used to determine if the model could correctly segregate and 


identify what it already held in storage. 


The first-two position results related a noticeable 
Similariity between the languages, and required at least a 
two word sample size to achieve correct decisions in 50% of 
the tests. The results of this test are tabulated in table 
Iv-V. Tests involving the first-three sounds showed even 
greater improvement in the results. This indicated that at 
the three-sound level unique patterns were emerging. The 
test of three-sounds duplicated the procedure used for the 


two-sounds and their results are tabulated in table IV-VI. 


Testing beyond this point was considered insignificant 
due to the limited reference data. Beyond three-sounds, the 
tests were approaching 100% for as few as five word/phrase 
test samples. The consensus was that even though patterns 
were uniguely developing, their percentage relative to the 
whole of a language negated the significance of the 


patterns. 


A final group of tests was run with the program changed 
to use Bayes’ Rule, rather than the Prognostic Progression. 
The results (not included) were for all practical purposes 
identical. The Bayes’ Rule method also forced 100% correct 
decisions as early as five words. On the basis of this test 
it is concluded that Bayes’ Rule and Cooks Prognostic 


Progression perform egually on this type of task. 
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Table IV-I. 


Bayes’ Theorem Calculations for one-sound 








Y; P(AIY,)  P(BIY,) P(CIY,) P(DIY,) PCEIY,) 
r . 237 204 . 142 .179 ~235 
N, .179 .181 223 . 260 2157 
S, 281 167 .185 140 208 
v; 084 250 . 266 252 -148 
ie 035 1617 374 .169 261 
No HO710 350 234 .148 -200 
So a 029 .185 269 405 
V5 S270 .173 . 169 . 236 2153 
f, . 230 .129 . 228 .254 -162 
N, . 284 . 160 .197 5 2157 
S, .078 300 208 S150 383 
V5 .174 .215 ser a2 a2 
Fy 041 . 186 ROT 302 .253 
N, . 146 . 263 .213 096 . 28 3 
S, .499 083 . 131 .140 .148 
V4 . 130 223 228 .252 -163 
Ee BO22 396 .151 2274 .216 
Nay .144 .169 . 296 2154 on 
oa 051 . 328 . 162 .179 .280 
a 358 .098 .178 233 2134 
P, .293 .138 - 067 297 .205 
Nn . 396 102 . 226 2194 096 
; . 220 000 . 137 2491 .153 
, 074 .313 B229 .126 .258 


ee ee ee ee ee ee Ge? EP ae GD Ge a ee ae GG ee SS 
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Lape. I= 15. 


Bayes Theorem Calculations 
for two-sounds 


= i ee ee ee ee eee 





Sf Se ee SEE ESS SE ee oe) eS eee 





a, PaGalie;, Fi) PETE, HD 6 6P CCI FY) POOLE, &) PCE, Y;) 
F, F, 000 .000 - 000 000 10 

F, No - 037 - 31818 - 154 -068 -353 

wv, 303 -199 . 188 -190 ; 920 

N, Fe - 000 - 000 - 900 -000 -000 

N, No .000 - 161 - 400 .000 .438 

Ny S2 - 000 -900 1.9 -900 -900 

N, V2 . 200 - 163 .179 .328 20 


SD SD ae GD UD DP Se ee ee ee Se eee SE SS a Ee ce one ae ae ee 


S Fo .192 -000 . 000 .351 457 
S, No 062 ~324 243 .151 22m 
S; So . 000 -000 . 000 000 1.0 

S1 V5 . 298 -192 .154 141 ~ S26 
YP. . 000 -334 443 425 .098 
V; No aio 126 . 285 246 .075 
Vs So 044 -052 - 403 08 1 .420 
V1 V> .188 .109 . 097 .570 1007 
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Table IV-III. 


CookS Prognostic Progression Calculations 
for two-sounds 








Y, Yj Tae i) Cems Y ) L(Ciy, % ) L(DIY, Yj) LCEty, ¥,) 
rar 000 .000 .000 .000 1.0 
F, Np 045 .359 2114 “OG 2421 
F, Se .293 .000 - 000 .108 .599 
Fy. . 363 .184 . 138 2172 .142 
N, F, .000 .000 000 .000 .000 
N, No .000 .144 483 000 a2 
N, So .000 000 1.0 .000 .000 
N,V, .171 . 130 woe 409 .098 
S, Fp 253 .000 000 231 .516 
S) No .085 .334 219 .103 .260 
ous . 000 000 000 .000 1.0 
Sve . 373 tan . 126 .088 232 
var; 000 .325 486 mB0 060 
V; No 045 .290 339 S276 .050 
V, So .018 .059 2521 .099 .302 
We V> .073 .118 2119 664 .026 
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Table IV-IV. 


Shifting Effects Denoted 
by the Difference in (V-III) - (V-ITI) 





— WEGin=-aina PS WEEP => ol = qe ee RG a Sr SE SS eee 





A B C D E 
Fe - 000 900 - 000 -000 -000 
PN 008 Wese)  s08o(-) 006 (-) 068 
FS 011 000 000 .029(-) .018 
FV - 060 0H 5 (-) - 0500) ~-O1ae—) «Oz 
NF - 000 -000 - 000 -000 -000 
NN - 000 -017(-) - 083 - 000 - 066 (-) 
NS - 000 -000 - 000 -000 -000 
NY nopey=)  .0seKe) <S0ne 081 .032(-) 
SF 061 000 000 s126e-) sds9 
SN 023 010 028 (he. OMB Re 089 
S5 - 000 -000 - 000 -000 000 
VF 000 -009(r) 043 005 038 (-) 
vN .074(=) 0.14 054 030 .025(-) 
vs Rozen) 6|OCw O07 .118 018 .118(-) 
vv .115(-) 009 022 094 017 


> ee SD SP ee SS aS a ee a SS SS SE Ee <a a Se 
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Table IV-V. 


Two-Sound Results. 
using Prognostic Progression 


— = SE eee a eee a Se eG a ee ee i ee SS SS 


smpl 
Lng size A B C D E 
A 33 - 6x3 7 eae -000 ou -061 
B 25 -240 - 400 - 08 - 120 -016 
c 4g - 367 - 204 - 163 «265 - 000 
D 42 - 286 - 143 -000 oa - 000 
E 60 «Sue? -050 - 150 -150 - 8 


an Ee ee ee ee ee ee eee ee ee mc Sea ia a Ge 


Sapl 
Lng size A B € D E 
A 16 2563 sali25 -000 «oa - 000 
B eZ -083 - 583 -083 -083 = liga 
c 24 = Tow eZ -208 = 20 -000 
D 21 «238 - 095 -000 -667 - 000 
E 30 2133 - 067 s 35 slo? - 500 


> ep ab a ae SP ee ee ee ee ee ee cs ee ee ee Se ee om a ES aS ee a es 


Average correct single word = 83/209 = .397 
Average correct two words = 50/90 = .556 
Sample sizes of 15 and greater gave results of 100% correct 


selection in 4 of 5 tests. 
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Table IV-VI. 


_ Three-Sound Results | 
using Prognostic Progression 


ONE-WORD TEST: 


72 ee ee ee eee Ee 





SE eee Ee ee ee eee, oe ee ee eee a Sm me ee SSS SI SS een Se 


sapl 
Lng size A B C D E 
A a3 -667 2033 -100 wea -067 
B 22 - 160 - 480 oo'0 - 160 040 
C 49 -306 941 a7 2265 -061 
D 42 ~214 -071 -048 «2195 -071 
E 60 2A] ~ 083 ai33 «167 - 400 


A ED EY ED ED ED ED ED ED EE ED EE ED ED SD Ee ce EE cD ED ee ee ee ee ce ee 


Se ee ee ee eee 


smpli 
Lng size A B C D E 
A 16 -688 -000 -900 so - 000 
B 2 -083 - 580 AoW -167 - 900 
S 24 slo - 042 -500 292 - 000 
D 21 095 -048 -048 ~on0 - 000 
E 30 - 100 - 067 - 100 - 167 -567 


a > ES ee Se ee ae ee ee ee ee ee ee ee a ee ae a eee Se se ee ee eee 


Average correct single word = 97/209 = .464 

Average correct two words = 64/103 = .621 

Sample sizes of 11 and greater gave results of 100% correct 
selection in all tests. 
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Table IV-VII. 


Model Test Path 
for 
A Given Word Length 


A EE A a RS GE EE [CSUR See eee me ee 


word 
length Test Criterion 


=> <p ee a ee ee ee ee se ee ee es ee ees ee es ee ee es es es ee ee es ee ee ee es ee ee ee eee ee 


1. P (x) 

2. P(x), L(y, y, 1x) 7 

3. P(x), L(y, y, 1x), L(y, ¥, Y, 1X) 

4. P(X) ve Lily, y, (Xe L(y, ¥, ¥, EY Ly, y. ¥, ¥, 1%) 

3. P(x), L(y, y,'X)- L(y, ¥, ¥,1*)- L(Y, ¥, ¥,¥, 1%). 
L(y, ¥, Y3 Y4 Ys '*) 

6. P(x), L(y, y, 1x)» L(y, ¥, ¥,1%)-+ Ly, yy, ¥,1%)- 


L(Y, Yo ¥3¥4 Ys 1X)» L(Ys¥g1X)+ L(y, Y, Y,Y,1%)- 


> SE EPG GE SE OE GP oe OO GP ee ee Ee eee ee ee 


= language = A,B,C,D,£ 

= sound category = F,N,S,V 
= first sound 

= second sound 

third sound 

= fourth sound 


= last sound = n 


Nm & WW NH =| SS 
il 


= next to last sound = a-~l 
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V. RECOMMENDATIONS FOR FUTURE STUDIES 
V5 The model finally developed has considerable 
potential which was not exhaustively exploited herein. The 
reference data should be vastly expanded, or at least 
altered to reflect the most current language/vocabularies in 
use. 1000-1500 words and/or phrases per language is 
suggested, because only 20% of that range was used here, and 
it appeared that word uniqueness was overshadowing language 


uniqueness. 


Ze The Prognostic Progression formula (no. 7) should be 
tested at greater lengths to identify its unigueness, or 


qualify it as an extension of Bayes Theoren. 


3. The largest problem encountered was in not having a 
spoken speech to phoneme converter. Assuming a Similiar 
lack of availability for future researchers, state is 
recommended that high quality transcriptions be obtained 
earlyon, and the quantity of transcripted words be used as a 
criterion for continuance. Further phrase transcriptions 


will provide better results than word lists. 
4a. Diacritical Marks and emphasis were not considered 


herein, but may prove beneficial in further language 


differentiation developments. 
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VI. ONCLUSION 
The following are presented in relation to the findings 


herein: 


1. The IPA does appear to be an able vehicle to support 
a language differentiation scheme. However, the ROMI 
conversion alphabet should have been expanded to reflect 


diacritical marks across the languages used. 


2 Single sounds are definitely an inadequate 
differentiation approach. Two-sound combinations present 
only borderline results. At the three-sound point definite 
unigueness appears, and is improved with each additional 
sound brought into the combinations. The ending sounds 
(backdoor approach) begin to present patterns of interest at 
the two-sound point, and should have been explored at 
greater length. For the languages used herein (all 
European) it is generally concluded that patterns emerged 


which were unique in spite of the limited sample size. 


a The final analysis and decision model (computer 
program 2) is considered a successful tool for language 
differentiation. It appears to have considerable potential 


for expanded analysis. 


4. Perhaps the most important, though unexpected, result 
was obtained via the "loopback" testing of the model. This 
test pointed out the model was capable of pattern matching 
decisions of high reliability with small test samples (<5). 
This certainly presents an improvement to the technique of 
word matching, in that storage requirements and accessing 
times are greatly reduced. This potential is limited only 
by the ability to update the data base relative to current 
desired operability. | 
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APPENDIX A 
THE SPEECH SYST Edt 


When discussing the speech system, it 1S quite possible 
to delve into many scientific fields of endeavour in order 
to fully exploit the interaction and functions of the 
components. In an overview the players in a system diagram 
analysis (figure 1) are the transmitting section and the 


receiving section. 







RESPONDERS 
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TRANSMITTER RECEIVER 


Pigure A-1. Block Diagram of the Speech Systen 
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The receiving section is comprised of the hearing 
organ, the brain, and the responders. A schematic of the 
hearing system (figure 2) shows a subdivision into (1) the 
outer ear, (2) the middle ear, and (3) the inner ear. 
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OUTER EAR 


Fic. 75.—SEMIDIAGRAMMATIC SECTION THROUGH THE RicHT Ear (Czermak): G, EXTERNAL 
AUDITORY MEATUS; T, MEMBRANA TYMPANI; P, TYMPANIC CAVITY; 0, FENESTRA OVALIS; 
R, FENESTRA ROTUNDA; B, SEMICIRCULAR CANAL; S, COCHLEA; VT, SCALA VESTIBULT; 
Pr, scata TYMPANI; E, Eustacuian Tuse; R, Pinna. 
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Figure A-2. The Hearing Systen 
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The outer ear functions as an antenna, and both ears used 
together comprise a directional antenna system i.e., capable 
of determining the direction of the sound source relative to 
the listener. The outer ear also contains the feed system 
to the receiver. The middle and inner ear comprise the 
receiver itself. The middle ear senses the presence of 
sound and passes the mechanical energy to the inner ear, 
where it is converted to electrical impulses modulated in 
both amplitude and frequency. These impulses are then sent 
to the brain which detects the Tee Gert oe. and activates 
decision circuits to provide physical reSponse as necessary. 
The Eustachiam tube (E in figure 2) insures that there is 
equal pressure on both--sides of the ‘membrana tympani 
(eardrum). The Eustachian tubes also provide sone 
Side-tonal pressures whenever speech or sound is generated 
at its input (upper part of the throat behind the nasal 
eayity). This side-tonal effect appears to provide a 
negative feedback to "pad" or reduce the effects of the 
amplitude of sound waves emitted when they arrive at the 
receiver input of the "transceiver" system. An analogy to 
the "TR/ATR" action in a radar is certainly suggested by the 
Side-tone effect. The functions of the receiving’ mechanisno 
are those for which a mechanized language differentiation 
model is desired. The available technology does not dictate 
a direct approach to the solution, however, the basics are 
clear in the "blackbox" approach. The output is a response 
resulting from a decision based on the input received. The 
type of decision device, must then be determined from what 
information can be gleaned from the input. The input is 
assumed to be an exact duplication of the output from the 
transmitter section. Therefore, if it can be determined how 
the transmitter functions based on observable output, it 
seems feasible that some model could be developed to 
interpret the information the transmitter would ‘send 
Subsequently. 
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The transmitting section is of much greater interest 
and value to this research. Basically, as seen in figure 1, 
the brain sends responder messages to the lungs and vocal 
shaping components. The lungs provide the mechanical energy 
which is shaped, amplitude and frequency modulated, and 
emitted as speech or vocalization. Thus the transmitter is 
the human vocal system. A block diagram of the system 
(figure 3) gives the breakdown of the vocal system into four 
subdivisions. The schematic diagram (figure 4) gives the 


anatomical layout of the components. 
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fig. A-3.a. The "Black-boxr" Approach 
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fig. A-3.b. The "Black-box" Expansion 


Figure Aa-3. Human Vocal Systen 
{block diagram) 
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The lungs act as a power supply, collecting air such 
that it can be expelled with the required volume and 
velocity whenever dictated by the brain. Both the voluntary 
and involuntary actions by the lungs are used in the speech 
process. Voluntary action is very apparent in the "raised" 
voice or "yelling" process, where there 1S an obvious 
increase of pressure on the lungs forcing the volume and 


velocity out of the lungs to be increased. 


The oscillator and resonant cavities consist of the 
glottis (vocal chords), the pharyngeal passage (upper 
throat), the oral cavity (the mouth), and the nasal passage. 
The glottis, when excited by an expulsion of air, vibrates 
or oscillates at a fundamental frequency (male average is 
130Hz, female average is 205Hz) (Ref. 18}, and generates 
Many harmonics. The frequency actually emitted may be the 
fundamental, a harmonic, or some combination. In general, 
when sounds are emitted there is an amplification of some 
parts of the spectrum and suppression of others, much like 
the characteristics of a bandpass filter, i.e., selective 
transmission. The fundamental frequency is a result of a 
number of factors, namely the size, shape, and elasticity of 
the glottis, the circumference, length, and shape of the 
trachea, and the size shape, and depth of the oral and nasal 
cavities; specifics for typical dimensions and frequencies 
relative to these paramenters can be found in appendix eB. 
In the aggregate these factors constitute a tuned waveguide 
and/or cavities which tend to alter the basic frequency of 
the glottis vibrations. In support of this, consider what 
occurs when the velum is opened (by its trapdoor action) to 
allow air to flow through the nasal cavity; the volume of 
the system 1S increased which is consistant with a lower 
frequency of resonance normally attributed to the nasalized 


tones. 
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The pulse shaper provides the ultimate control of power 
during constant output. The shapers consist of the tongue, 
teeth, lips, jaw, and palate. The shapers affect both the 
frequency and amplitude of the transmitted waves and thus 
act very Similiar to a modulator section in a radio 
transmitter. Frequency variability occurs in the range 
available from the fully open to the fully closed mouth, and 


has numerous descrete frequencies in between these extremes. 


The antenna or radiating system consists of the actual 
oral and nasal orifices. The lips and jaw both work 
independently and together in the shaping, frequency, 
amplitude, and emission operations. The parameters of 
importance in the output are again frequency and amplitude. 
The sound wave emitted will be of higher frequency when the 
mouth is more in ‘the closed position than open. The affects 
of the nasal chamber is controlled by the velum, which 
determines the amount of air (by volume) allowed to pass in 
the cavity. The nasal-oral combinations regulate the output 
frequency by combining the orifice controls of the velum and 
the tongue position. These combination sounds occur. nore 


frequently in languages other than English. 


RECOGNITION OF SOUNDS 


Every emission via the vocal systen falls into 
categories which have been previously developed and 
quantized in varying degrees. Numerous experiments have 
been conducted particularly in the speaker recognition area 
where the unigueness within these categories are further 
quantized to search out factors which are undeniably speaker 
dependent, and thus provide keys to recognition algorithms. 
Doherty [Ref. 19] used 25 significant factors in his Long 


Term Spectra (LTS) analysis to achieve 100% recognition (a 
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rarity in recognition results) for a sample size of 50. 
Doherty did not provide a specific breakdown of the 25 
factors he used, however, he did indicate that various 
combinations of frequency, times, nazalization, pauses, 
stops, and fricatives were some of the factors used. Others 
such as Young [Ref. 20] have used a single category of sound 
to promote recognition schemes. Young looked specifically 
at nazalization and achieved results from 50% to 97% with a 
sample size of 13. The categories are relative then to 
recognition of many entities in speech. Word/phrase 
boundaries as well as the individual components and sound 
patterns can be identified and removed for analysis. 
Experimenters make every effort to eliminate odds whenever 
possible, therefore, it is not uncommon to find that when a 
project is dedicated to a specific area of sound to research 
that test segments are carefully screened to be 
representative of the specific area. Sounds can be 
classified by a number of parameters (time, frequency, forn, 
etc.) One such classification technique categorizes sounds 


by form, i.e., by the way they are generated. 


The categories of concern are: the vowels (v), the 
stops (s), the nasals (n), the fricatives (f), and the 


pauses (p). 


The vowel sounds are that group which flow smoothly 
(uninterupted) from the lungs through the oral orifice 
utilizing the whole of the vocal cavity, with minor 
adjustments in resonance promoted by the specific location 
of the tongue. Vowels are further defined according to the 
tongue position with respect to the oral cavity (mouth) in 
general. The front vowels are generated whenever the tongue 
position toward the front of the mouth is the major factor; 
the sounds specific to this location are: 1(HF), I(LHF), 
e(HMF), E(LMF), ew(LF). The back vowels are: u(HB), U(LHB), 
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O(MB), o(HLB), a(LB). The central vowels are: JS(MC), s(MC), 
3(MC), a(MC), a(LMC). In addition to the distinctive vowels 
noted above are a group of connected vowels whose separation 
are said to alter the meaning, therefore they are taken 
collectively and have the special sub-categorization name 
"diphthongs." The diphthongs are aI, aU, Of, and ju. 
Diphthongs are also referred to as the vowel-glides since 
there is an alteration in the resonance which resemble the 


glide consonants. 


The stop category consists of those sounds which are 
emitted as a 'puff-of-air, i.e., no energy is followed by a 
large spike cf energy. Stops are also known as plosives due 
to the energy spike, but stop is more commonly used _ since 
the speaker must actually establish a stop in the snooth 
expulsion of sounds to generate the plosive burst. The stop 
category has the components: p, b, t, k, and gq. A 
sub-category of the stops is the affricates tJ and d3 which 
are again combinations which cannot be reduced without 
changing the meaning, therefore, they are classified as 


unique and essential. 


The third group is known as the fricatives. These 
sounds require a velocity change which is achieved by 
creating a narrow slit between the upper teeth and the lower 
lip, the upper teeth and the tongue, or the upper teeth with 
the lower teeth. The objective is to create ae higher 
frequency sound at a higher velocity. The components of 
this category are: £, V, 68, Gy Sse Z, ve 5° and h. At 
present there are no apparent sub-categories which are 
considered unique. 


The nasal category actually consists of glides, nasals, 
and laterals. The glide is so called because OF 
articulation organ movement required in its generation. For 


example, note that when the single sound of "r" is generated 
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the oral cavity enlarges, Boe eeacarinity note that the tongue 
drops in position. Enlarging the vocal cavity tends to 
lower the frequency of resonance, such that the sound is 
readily identifiable as a smooth transition from a high toa 
lower frequency. The nasal sounds are those sounds which 
are emitted via the nasal cavity only, that is the mouth is 
blocked by a tight closing of the lips for the sound of "n," 
and by the tongue pressed firmly against the hard palate for 
the generaticn of "n." The final component of this 
sub-category is generated when the mouth is blocked by the 
center and back of the tongue to form the sound associated 
with Hy: The lateral has a single component "1." The 
lateral utilizes both the nasal and oral cavities for the 


generation of this glide-like sound. 


The final category is that of pauses. A pause is an 
obvious lack of sound of sufficient duration so as to 
provide a well-defined boundary for a word or phrase as 
appropriate. Pauses really represent a special situation 


that regqyires a more indepth discussion. 


The pause is that period in speaking which is denoted 
by the total absence of sound. In writing one provides word 
separations cr boundaries by the absence of script or by 
leaving an intentional space. Unfortunately this does not 
follow in speaking. In speech, words ‘are run together such 
that a combination of words (hereforth defined as a phrase) 
would appear as a single utterance which is devoid of 
pauses. For example, the phrase "the brown cow" is spoken 
as "thebrcwncow." In a transcription from the text the 
phrase becomes "dq braun kau," but the spoken appears as 
"Sabraunk u." This draws forth the problem of sound pattern 
Significance for the two cases described as (1) with and (2) 


Without pauses... 


It appears as though a pause may or may not be present 


a2 





oy 








with equal probability. Im the case of English it is 
observed that three distinct areas enter into the study of 
pause insertion. Speaking generally for the JU.S.A., the 
Northeast coast population speak very rapidly and pauses are 


difficult to detect; Southerners speak very slowly, so that 


pauses are dasitanet and occur more often; 
Westerners/mid-westerners fall somewhere in between, 
representing a median for the extremes. The actual 


detection of word boundaries via the spectrograph or 
Sipiliar process has not been achieved with any degree of 
accuracy, and is particularly poor with an unknown sample 
{[Ref. 21}. Therefore the problem must be looked at frem two 
points of view. The first looks at the sound patterns based 
On individual word list inputs. The premise here is that 
the probability of any word beginning a phrase could be 
high. From the words of the phrase "the brown cow" one 
would certainly expect the article "the" to follow a pause 
With a high degree of certainty. The word "brown" could be 
given a probability assignment of .5 (equiprobable that it 
would or would not begin a phrase with some certainty). The 
word "cow" would have a much lower degree of certainty 
associated with it, but greater than zero, since words such 
as “cowboy,” etc. exist. A second argument would be that 
only phrases (devoid of pauses) exist and they therefore, 


Should be the only sound strings used in the analysis. 


The solution to these points as far as this thesis was 
concerned was to look at both words and phrases, 
individually and collectively. The rationale being that 
Since the analysis only looks at the first four and last two 
of the scund string, and since there is’ sufficient 
probability that any given word may be the first word in a 
given string (phrase), that the characteristics of that 
individual word are very germane to the sound string 
analysis. The use of phrase analysis, recognizes that pause 


detection may be very applicable in the conversion of spoken 
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‘ 
sounds to a computer-acceptable input. 


A series of sound emissions can be observed either in 
whole or in part wherein individual sounds can be sensed 
categorically or uniquely, dependent upon the type of 
analysis which is desired. In the analysis by Young [ Ref. 
20} he was able to concentrate on the nasal phonetation by 
first carefully selecting words (including nonsense words) 
which clearly contained Significant nasal sounds, then via a 
masking or filtering technique he removed those quantities 
which were not of concern to hin. Others have used a 
variety of masking techniques to eliminate all but the 


desired elements of the test speech. 


The concern herein was not to extract any sounds or 
rely on the unigueness of sounds for the purpose of 
recognition. Rather the intent was to establish whether or 
not sound patterns are unigue to languages, on either an 
individual or a categorical basis. In order to achieve this 
end it is necessary that sounds can be uniquely defined 


categorically. 


ELECTROWIC PERCEPTION, OF SOUND 


Many attempts to fully comprehend the operation of the 
vocal system have been made through thé ages. A large 
number of models were constructed across time, many were 
unsuccessful, while others proved major contributions. [n 
the late 1700's three mechanical models were independently 
produced by Kratzenstein, VonkKempelen, and Wheatstone, each 
of which used bellows, leather resonators, and reeds to 
create fricative, plosive, and vowel sounds [Ref. 18]. The 
first real electrical model of any value was the "VODER," 


(VOi1ce Operation DEmonstratoR), developed by H. Dudley in 
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the 1930's, (Dudley also developed the first VOCODER: VOice 
CODER; also in the 1930's). The VODER consisted of 
oscillators, resonance chambers and a keyboard to _ control 
the frequency, amplitude, and duration of each sound in the 
formulation of words at its speaker output. The YODER had a 
vocabulary of 2500 words [Ref. 22, pp. 16, 32, 49, 191]. 


The spectrograph is an analysis tool which came into 
being in the 1940's as a result of studies by Koenig, Dunn, 
ema Lacy [Refs. 3 and 23}. There are various types now 
being used, however they have common operations in that all 
take in speech sounds, filter separate the sound into bands 
(bandpass filtering) and display the results on a frequency 
(vertical) versus time (horizontal) basis. The spectrograph 
uses basically a Fourier analysis processing [Ref. 3]. One 
motivating factor behind the development of the spectrograph 
was the desire to have a visible speech mechanism to aid the 
learning process for deaf persons. Many side uses. were 
subsequently developed for the spectrograph. Peterson and 
Barney used the spectrograph technique to confirm that most 
English sounds occur in the standard telephone bandwidth of 
300-3300HZ (Ref. 18, p. 154]. Spectrograms have been used 
as a means to develop speaker identification through 
"vyoiceprints." Voiceprints are unfortunately not at a state 
where 100% accuracy is consistant with experimental results. 
Therefore, voiceprints promote (presently) a shadow of doubt 
in their reliability and are not accepted as a legal tool. 
The voiceprint area is one of considerable interest to the 
speaker recognition subgroup of the Acoustic Society of 
America, and was the subject of attention during a special 
open discussion during the 90th meeting in San Francisco 
{Ref. 9}. 





APPENDIX B 


Some Characteristics of Vocal Tract Tramsmission 








ORMANT = manifestation of the normal modes as_ spectral 


peaks in the output sound, which are derived fron 


the transmission function relating the mouth 


glottal volune currents. The transmission 


function is: 


¢ 


H (jw) > pele = erie = aia tee = --47- 
al j sinhal cosh(a+j9)1 cos(“ 





) 


where = w/c and w = (2n + 1) 


Formant Parameters: 


La = x = acoustic inertance/unit length 

Ca = 3¢r = accoustic compliance/unit length 

Ra = =, wir = acoustic resistance/unit length 

Gq = es to = acoustic conductance/unit length 


A = tube area = coud = 0.775 ine 

cr = radius of tube = 1.26cm = 0.497in 
S 

p 


= tube circumference = 7.9265cm = 3.12in 


Beair density = 1.14 x 10 3qnyca2, 
(moist air at body temperature = 37 C) 
C = sound velocity = 8.5 x 10*cm/sec, 
(moist air at body temperature = 37 C) 
H = viscosity coefficient = 1.86 x 10 * dyne-sec/cné, 


m0 C, 0.76 in.Hg.) 
» = coefficient of heat conduction 
= 0.055 x 10 3cal/fcm-sec-deg (0 C) 
7 = adiabatic constant = 1.4 


Cp = specific heat of air at constant pressure 
= 0.24 cal/fgm-deg (0 C, 1 atmos.) 
1 = tube length = 17cm = 6.6931in (for avg. male) 


f= = “= = a(11141) 
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The 


< 
MN 


x « a « 


formant frequencies as computed are: 


ee irae 4) 
(2n + 1) (3230/sec), 


5 Agile 2 2 0 


3230 radians 
9690 radians 
16160 radians 
29070 radians 


rh Fh Fh OF 
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(2n + 1) (gS-) 
(2m + 1) (5174.4), 


n=0 7.1 gales ona 


514.4 Hz 
1543.3 HZ 
2571.9 Hz 
4630 Hz 





APPENDIX C 


TRANSORIPTION SYABOLS 


ROMI to IPA Conversion Tables 


Table C-I. Fricative Conversion 


ee a ee ene eee ee 





ROMI IPA 
PF f 
V d 
Q 6 
T-H 3 
S Ss 
Z zZ 
S-S j 
2-2 . 
H h 
Q-A $ 
B-B 8B 
C-H € 
S-H S 
2-H Z 
X x 
Ge F g 
X-X x 
R-B b 
H-H h 
0-8 J 
Z-C z 
J-A 5 
G-B Y 


SS ee Re ee ee ae a es Bee ee a: EE eee ee See eee Se ee eee SS SS SS SS SO ES SS 
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Nasal Conversion 


Table C-II. 


IPA 


ROMT 


Ww 


Seman. oe se ig Se et S&S as 


vo 
= > i Og my 2 Ol =z N © WwW & 
1 ! 1 ‘ ' ' ' ' | ! ' ' 
ZB2 2wrAHeme He eye BH Be HR KE MS HY UCU 


np ee 68 ee eee ee ee ee eee ee a ce eee ee ee eS eee ee 
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Stop Conversions 


Table C-III. 


IPA 


ROMTI 


ie Se SE oo ee a gE eee ee oe eee OS ee eee eee ee ee a ee ee ems ee ee 


a YP ADM MDM YV 


‘ 
aH MaA M YU YU 


~ 450 OW 


mRAH A O Ob = 


FPRICATIVES 


a a ee wee oe SS 


A 


tf 


Eee ee ee a ee ee ee ee ee ee ES See eS 
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Vowel Conversion 


Table cC-IV. 


=  : : ya  emm SSUCUe mm e — 


IPA 


ROMI 


| se oY Ww % 


oe @ © Ss 39 =e ™ 


= fx] eo) mn > H 
' ' ' ' ! 1 
HHA wk @ ec OU oO Dp bp FP 2 


a, (&) 


Woe QmrAm 6 Bw Ba 


PHTHONGS 





tH 


D 


al 


ju 


au 
ou 
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Table cC-~IV. 


(Gone. ).: 
ROMI 
19) 
E-Q 
A-R 
i 2 
U=S 
E=5 
zZ-0 
U~Q 
C-Q 
bo 


DIPHTHONGS 


IPA 


NS a em Se ee a rcp cam em eS 
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APPENDIX D 


The data and graphs contained in this appendix were 


used in an attempt to achieve a decision model to 


differentiate languages. 


across the sound rows are normalized to 100; 


In each language the data reading 


the columns 


~<, 
were not normalized, however, the columnar data were used to 


compute the mean E(x), 


graphs were generated in accordance with formula (4). 


Combined Vector Data 


and the standard deviation o(x). The 


Pa 4 ae, ede ee eee eee ae EE ee Ge st SSG ee Se le as ee  “ a eee See a eS eS 


a c-Si > a a ar a Ee eS Se ee ae 





lE2= 3 


1-2-3-4 





SS ee ee ee a ee ee ES es ne ee 


SP Sa ee e-em eee SS SE 


Table D-I. 
PF N 
33.48 16.29 
0.98 7.84 
17.46 45.24 
15. 46 54.63 
1.35 7 Se 
13.75 28.31 
TW. 22 18.21 
28.57 16.49 
4.55 33.39 
9.80 25.49 
7.31 14.63 
24.14 20.69 
14.93 23654 
9.71 8.85 
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Taner D-I (cont.): 





3259 
Da22 


li2eua.S 
6/07 


Bi2- JD 
36m 17 


29.32 
5.48 


23.65 
16.67 
J 2ee | 8 








19.92 
14.17 
tes 


170 
141 





OS — A 


= a cc ee me em eae Et ee ee 


= ED SS Ee ee ee Se ee eee ED ee ee ee ee SS ee See ee a a ee ee one 


C 1 

C V=2 

C im2=—3 

C i 25-0 
© n 

S n-1 

c E (xX) 

Cc o(x) 

D 1 

D 2 

D 1-2-3 

D lee — 3-4 
D n 

D n= 1 

D E (x) 

D O(x) 

E 1 

E l= 2 

E i 2~3 

E n— 2- 3=4 
E n 

E a 1 

E E (x) 

E O(x) 
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Using the data from table D-I and the following 
information from table 6.1 of Lipschutz[Ref. 17], points 
were calculated for the magnitude coordinates at 0, +], +2, 


and +3 standard deviations, for each category: 


0= 0.4 _ t1 = 0.242. +2 = 0.054 +3 = 0.0044 
a(x) of (x) O(X) of (x) 
Table D-II. Fricative Magnitudes 
~3 od -1 0 1 z 3 
apx =-16.91 -6.69 3.53 13.75 23297 88.19 i .41 
A,F (x) -00048 .005 024 - 039 024 -005 0004 
A, 0(X) 10.22 
bee A —152 = .009 5.22 14.93 24.64 34.35 44.06 
Bao (Xx) -0005 .006 ~0Z5 - 041 025 -006 -0005 
B,o(X) 9.71 
Cryx =—55685 O2916 onl? i2Zov6 18.99 24.2 30.21 
Cex) -0007 .009 040 067 040 - 009 -9007 
C, o(X) 6.01 
Dx -4.82 1.99 8.8 15.G@N 22.02 29.23 36. 04 
Dy, Les ) -0006 .008 - 036 059 “U0S6 -008 -0006 
D, O(X) Soo" 
By Xk =12.05 ~22.91 6.23 15237 261° S3765 2.79 
Eye (X) -0005 .006 -026 O44 026 -006 -0005 
Ee a(x) 9.8 


a ee ee ee ee ee ce ee ee ee ee Se 
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Table D-III. Nasal Magnitudes 


a a a a eee ESAS SS ELE SE SS Se a a <a e 











-3 -2 - 1 0 1 2 3 
A,X -26.32 -8.11 10. 4 230031 46.52 64R73 ‘G2agu 
A,£ (x) ~-0002 .003 ~013 ~022 ~013 ~003 ~90002 
A, O(X) We ..2 17 
B,X -3.21 5.64 14.49 23.34 32.19 41.04 49.89 
B,f (x) -0005 .006 . 027 ~O045 027 006 ~9005 
B, o(X) 8.85 
€ ,x g42388 48.36, 23.84 29232 384.80 @0-28 45.76 
C,f£ (x) -0008 .01 O44 eZ O44 O07 -0008 
C, o(X) 5.48 
D,X 6.85 12.5 18 15 23.80 29285 35.10 &0.75 
D,t (%) -9007 .010 2043 Ot 043 -010 -0007 
D, o(X) 5.65 
E,i 2.99 8.96 14293" 20.90” 26987 32.54 § 36.64 
E,£ (x) -0008 .009 2-041 - 067 ~041 009 -9008 
E, O(Xx) 5.97 


> <a ce es ee ee ee ee ee ee ce es es es ee a ee es ee es es ce eee ee ee eee eo ee ee ee es 
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Table D-IV. Stop Magnitudes 


—_— ee ee ss ee eee eee 





-3 -2 ~1 0 1 2 3 
A,X -25.44 -12.32 0.80 13592 27508 440.16 53.28 
A,f (x) -0003 .004 018 024 048 -004 -0003 
A, O(x) T3412 
Bp, Xx -24.72 -10.67 3.38 17.40 «Slee 45.53 59.58 
B,t (x) -90003 .004 017 028 e017 004 -0003 
C, O(x) 14.05 
©, & =-4.68 201495 8.98 1Ss8t 22360) °29.47 362580 
Cy (x) -0006 .008 2-035 -059 «0285 -008 - 0006 
C, O(X) 6.83 
Dy, X 4.02 8.69 13.8% W6203 22.70 27438 — 32.04 
D,f£ (x) -0009 .012 ~JUS2 - 086 «Wo 2 012 -0009 
D, O(x) 4.67 


E,X -2982 6828 W668 23536 Sa62 doBye Hermon 
E,f(x) .0005 .006 .028 #.047 #4.028 #.006 =~ #§=.0005 
BE, (x) 8.56 
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Table D-V. Vowel Magnitudes 

~3 = 2 -1 0 1 2 3 
A,X ~Q7.72 —-17.784 13.94 44.02 74.60 105.18 135.76 
A,£(x) .0001 .002 -008 ~013 -008 -002 -0001 
A, O(X) 30.58 
B,x -16.16 3.99 26. 08 448529 64.464 84.59 04.74 
Beedz) .0002 .003 On 2 -920 o@12 -003 -0002 
B, o(Xx) 20.95 
C,% Vtaes 20871 32.19 “S@.0) S385 8639563 Ter tA 
Gyetixy .00048 7005 «023 -038 ~023 ~005 -0004 
C, g(x) 10.48 
D,X =§25936 11295 28.26 48.57 60.88 77.19 938.5 
D,f£(x) .0002 .003 -015 «025 -015 .003 -90002 
Dy of (X) 16.31 
E,X -11.06 5.91 22.898 39.85 96.02 73079 90. 716 
Byt(x) .0003 .003 2014 -924 -014 ~003 -0003 
E, O(Xx) 16.97 

ra 
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PERCENTAGE OF WORDS WITH FIRST SOUND AS NOTED. 
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COMMENT Hee he Be ea crmeire eae mee ete Ree ie eco Ig ee eRe ie hehe Se oe ee ste Se akc ate ae Bc ok che ste rae ee 
PROCEDURE GENERATETWO PROVIDES THE 16 TWO-SOUND CL&S8S LABEL 
DESTGNATICNS IN BO cay” QFNER, ines LAB8tS ERS 
STORED IN THE QNE OIMENSIGN ARQ AY NSOUNDTHO. THE Ages 
CORRESPCND TO THE 15 ROWS OF THE SET OF REAL ARRAYS NAMEN 
Mani=tTWwO *,"FTRSTLAST_", mele ee 2h 

be ae ope he ete she She sex a aie ake ate oh ote he he ate He ote See Se tek she Fe Xe Te the ak she hk ste che hc ic she te oe ae He He he Se oe ae He ee oe a Ht 
PROCEDURE GENEPATETWOA: 

BEGIN 

MAP EGER Y; 

hae O05 

moet 6S = 9 UNTIL 2.90 

BEGIN 

Boe 3 -= O UNTIL 2 90 

BEGIN 

SeeOrWwOCY) <= 8 "5 

Seo res (YF)CO}1) t= CLASS €1)0911)3 

momerorwO (VYFICLIJ1) <= CLASS (J)00]1)5 

mae YY + | 

END ; 

END; 

END GENERATETWO3 


COMMENT XR RR REKARARK KARR RHKAKE RAK EAARAE KK EAR RARKKEKAAAKSE 
PROCEDURE GENFRATETHREE TAKES THE 15 ELEMENT ARRAY 

"SOUND TWO" AND GENERATES THE 64 POSSIBLE COMSTNATIONS Qe 
THE FIRST THREF SOUNDS OF A WORD IN ALPHABETICAL nance, 
THE €4 FLEMENTS ARE STORED IN wer ONE OIVMVENSTONAL ABPAY 
“MSOUNODTRPREE" AND ARE YSED AS LA ARELS FOR THE 54 ROWS GF THE 
SET OF REAL ARRAYS NONETWO3 . 

keke KRKARAL SAKE KARA EKHEH F ehae KeEAKRARAKAERALA AA Hk KE HHA: 
PROCEDURE GENERATETHREEFES 

BEGIN 

INTEGER Y35 

veut O04 

POe 1 *= QO UNTIL 15 99 

BEGIN 

meee == 0 UNTIITL 3 30 

BEGIN 

SOUND ITAHREECY) <= © &5 

SOUNDTHREE(Y) (012) c= ean 

POO IRREET YI (CZTL) 2] CLASSUSJ) CIT): 

Ye := Yi 

ENM 3 

EMD $ 

END GENERATETHREE 

PROCEDURE GENERATEFIUR? 

BEGIN 

INTEGER Y3 

Y2=0: 

FOR ~— s= O UNTIL 63 99 

BEGIN 

FOR J := O UNTIL 3 30 

BEGIN 

SOUNDFOURTY)sat te 

SOUNNDFOURTY) (073) c= SAUNDTHREE(T)(GI3)3 

SQUNDFEFOQUR(YI( 37 1LIZ=HATLASS( JD COP LI: 

Y s= Vet 

ew) < 

END; 

END GENERATEFQUR; 
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PROCEDURE GENERATESIXS 

BEGIN 

INTSGER XyY¥3 

rac O° 

foe = -13 

ee 1 2=0 UNTIL 255 dO 

BEGIN 

FOR J := O UNTIL 2 DC 

BEGIN 

X 2s=X+13 

SeemOrFiTVE(Xj:;=" ©; 

ET) zeae intone 

SOUNDFIVE(X) (411)3=CLASS(J) (OIL); 

meee © t= O UNTIL 3 20 

BEGIN 

SOONDSTXCY)s=" 3 

MM el) ccemecievtoiiie 

SGUNDSIX(Y) C511) 2=CLASS(K) (O11): 

‘a1 *)] 

END; 

END 3 

END; 

END GENFERATESIX3 

UN ye he te A che ee ate hee eee he epee ee ee hc ee Se oe oe ee ee ate ee ee eee ek 
PROC ENURE GETARRAY 8 EADS IM THE 101 CHARACTERS US=D 3BY THIS 
PROGRAM TO REPRESENT THE INTERNATIONAL PHONETIC ALPHABFT. 
EACH SQUNOD REPRESENTATION TS PLACEN IN A TWO COLUMN ARP AY 
"QOMI," COLUMN INE ITS THE SQUND REPRESENTATION AND Be 
TWO CONTAINS THE CLASS: EITHFR NASAL, FR ITCATIVE, STOP, AR 
VOWEL. Tee NUMBER GF CARDS USED TO SEAM IN “RGM eer 
BE VARIED WITHOUT ALTERING THE PROGRAM, THE MIIMBCR OF 
ELEMENTS IN "ROMI" YS NOT VARTABLE. NROMTM TS PRINTED OUT 
moore OAT. A TIME, JUST AS IT_TS TYPED IN TRE AATA CARDS. 
He hee teak he ate ae he ate see se a he te Be he ee ae ee HR de oe oh he oe a ae ee he ote oo ee oo he oe eo ek ok ek 
PROCEDURE GETARRAY3 

BEGIN 

STPING( 81) PHONOL? 

INTEGER 8,A3 

Bs=15 

A := 13 

WHILE (A C= 27) DOA 

BEGIN 

Peek s= ms 

READCARD (PHONOL)3 

fone = ()¢ 

MHILeE(TP <= 79)90N 

REGIN 

WHILE (PHONDL (TPI1) =" ") AND (TOK=79) OL 

TPs= TO+#ls 

IF ( TP w= 8CQ ) THEN 

BEGIN 

TF (PHONOL (TPF1 41 )ra="-'") THEN 

BEGIN 

ROMI (8,1): = PHONGL (TPI1)3 

TO: = TP+#?s 

END 

ES 

BEGIN 

ROMI (B,1L1)(0l1)2: = SPHNNOL (TPI1): 

ROMI (B@,1)(1J 1): = PHONOL (TPFH2I1) 

TPs =TP+43 

END; 

ROMI (8,2): = PHONOL (TPI1)3 

Bs = Rel3 

fe <= Pe] 

END 

EESE 

Apt = A+13 

END 


LS5 
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SATIN], | De Ae Se He arcane eae See he ae ae ote ee ee eee ae a He a 2 oe 
LOGITAL PROCEDURE GETWOIRD BEADS THE INPUT DATA WPITTTEN ON 
eee «INT & BUFFER CALLED “TEXT,” ONE CARO AT A TIME AND 
ECHO PRINTS EACH CA2D WITH A SEQUENTIAL NUMBEO [DENTISETER 
men yexX .* Bee e ries “Text, TSeSGANNEDSOGNE CHARAETER AT 
mo Peee UNTIL A “WARD” IS BUILT. LEADING AND TRAILING 
BIeNKS DELIMIT A “WORD.” "WORD" ITS THE GLOBAL PARAMETER 
NF STRING TYPE OF MAXIMUM LENGTH (Cf 36 CHARAT TERS WHICH 
IS DELIVERED TC THE MAIN PROGRAM AND SURSEQUENT PROCEDUREAS. 
Mie THE “CHARACTER 2" TS GNCOUNTERED ON A DATA CARD IM 
OTHER THAN CARD COLUMN 1 THE LOGICAL VALUS OF "GETWORD" IS 
wee) =SALSE THUS SIGYALLING THE END OF A DATA SET, 
"WORDS" MUST START AND END ON THE SAME CARDO. 
Meee Me He Ae ate te ke ee oe He he He oe He ee eo Pec he He eH ee He Hoste she ok fs ae oe He ee ee 
LOGICAL PROCEDURE GETWORD (STRING( 326) RESULT WARD); 
BEGIN 
wee c= 0: 
TF (TP=80) THEN 
BEGIN 
BPE Xs=I1NDEXF1: 
REM OC ASD (TEXT): 
TP: =0 
END $ 
WHILE(TEXT(TPJL)=" ") ON 
8EGIN 
ea tP+1; 
IF (TP=80) THEN 
BEGIN 
Re ADCARD (TEXT); 
INDEX: =ITNDEX+#1 3 
TP3=0 
END 
END; 
VWOROs=" UR 
WHILE (TP<80) AND (TEXT(TPITL)-~=" ") DO 
BECIN 
WORD (WPI1):3:= TEXT(TEI1L): 
WO s= WP + 13 
im ae TP + 1 
END 3 
WP s= WP-1;5 
WORDa=Ig" 


END GETWOPRD; 
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i ra ieaeer Viglen ee <tc hoe ee ee eae ee ee ee one ee eee ee cee eee ee ee ie ee She cee ane oes eee oe he Se he Acre ete Be he oe he ee oe 
PROCEDURE PROB_TW) CALCULATES THE PROBABILISTIC PROGRESSTONS 
eee tite PPPS. WHEN CALGULATED, ARE REASSTGNED 10 THE 
CORRESPONDING BASE DATA FREQUENCTES ARRAYS. THE CALLING 
ARGUFMENTS ARE: 

Mme oer ARPAY BASE: CORRESPONDS TQ THE CUMMULATIVE 

Sewn COMBINATION FO@2 WHICH THE PP TS REING CALCULATED. 

2) REAL ARRAY PARTPRIBS THE ARRAY CONTAINING THE 
NCONDITICNAL™ PORTION OF THE PD, 

eee ARRAY UTP UTE THE ARPAY, JO WHICH THE PePtS ASE TN 
ae Ono CURT MUST SS PRAENSPENWED TEFNTICALLY WITH 
Zee: )6|6C Th HIE NUMBER. OF EREMENTS IN PWRTPROB LESS ONE. 
Pema Os THE NUMBSe GF eELFSMENTS IN BASF DIVIDED SY "HE 
MOUMBER CF ELEMENTS. IN PAPT PROB LESS ONE. 

Mee ae tech he ake ek cee oe He ea oe He ke eke eK ek eek Rk ee aK A ERR KK 
PROCEDURE PROB _TWO{REAL ARRAY BASE( Xe) SREAL ARPAY OCARTORIAB 
eee sREAL ARRAY OUTPUT (*,=)S INTEGER VALUE LO »FNIDI 

BEGIN 

REAL B;3 

ieee Te? s 

mee Be t= 0 UNTIL LOWON 

BEGIN 

FOR [. := 9 UNTIL FNID OO 

BEGIN 

B := 0; 

oe= Tei 5 

POR Jis= |] UNTIL NUMBCR 

B := BtBASE(T,J)*PARTPRNIR (YH, J); 

FOR K s= 1 UNTIL NUMBEP 

BEGIN 

IF(Bx=0) THEN 

eee 8) 2= BASE(T,K)*PARTPROB(H,K)/8 

OUTPUT (T,0):=-1 

END $ 

END ¢ 

END; 

END PROB_TWO3 
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E DECIDE 2E€ADS TEST SAMPLES, ANALYSES THEM AND 

A SET AF_NUMBERS TO THE ANALYSIST FOR A DECISION. 
SIZE" 1S THE NUMBER OF WORDS IN THE TEST SAMPLE 


ES meMe ee OE WORDS IN A TEST SAMOELE FOR 


eee, OWHICH IS LIMITED ONLY BY 
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COMMENT#*** WORD LONGTH = ONE**=* 


BEGIN 

T3=03 

FINDSOUND(9) 3; 
CNEPOINT:S=POINTER? 

FOR ~T 3:= 1 UNTIL NUMBER OO 

Dee (S'SQN(0,1) s=ON=1T( POINTER, 1) 
END $ 

ChMMeENT**s* WORD LENGTH = TWO*est% 5 
BEGIN 

aes 

ONEPOINT 2:=POINTERS 

FOF ITs=1 UNTIL NUMBER DO 
DECTSION(0, 71) 3 =ONS2(POTNTER,I)3 
FINDSOUND(1)5 

POINTERS =ONEPOINT#44P0INTER: 
Perko Te =F UNTIL NUMBER DO 
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COMM EN T 2% 2k 2 oe oe ee te eo ee ee tee oe i ee i A ook ee ee ko ee ok ee ek Ke ok kK 
PROCECURE RAW_ANALYSIS USES AN IDENTICAL "CASE" STATEMENT 
TG THAT USED IN "DECIDE" TO PARTITION THE DATA BASE SETS, 
RAR RR Ke a RR ERE ERR RK RK KK KS 
PROCENURE RAW_ANALYSIS3 

BEGIN 

INTEGER P3 

IF(R8AD=TRUE) THEN 

BEGIN 

POINTER :=0; 

IF (Z7<=5) THEN Pt=Z4+1l ELSE P2=6;3 

CASE P CF BEGIN 


COMMENT ##**** WORDS WITY ONE SOUND *2x#%* 5 
ONE SCUND(GNE1 ,0,4)3 


COMMENT ee *WORDS WHITH TWA SOUNDS **%% xXx 5 


BEGIN 
ONES OUM 


ND 
ONF SGUND 
END: 


Serre) =xeee-ynR OS WITH THREE SMUNDS ¥¥eaexx 5 


REGIN 

ONE SGUND (ONES 9,4) 3 

ONE SQUND(ONETWO3~-1,14) 3 
ONFSCUND(ONETWN33,2,6%) 
END; 

COMMENT #222 =WNR0S WITH FOUR SQUNDS ex 5 
BEGIN 

ONFSOUND(DNE4S ,0.4)3 

ONE SOUND (QNETWNG,1,16)3 
ONE SOUND(CNETWO34,2,64)3 
tee oO BOSSE 39296) 


See ess ee WORDS WITH FIVE SOUNDS *s sax s 


PEGIN 

ONE SQUND(CNES ,0,4)3 
ONEPCINTs=PO0OINTERS 
ONESCUND(ONETWA5~,1,16)3 
ONFEFSOUNC(OQNETWO35.2,64)3 

ONE SQUNC(ONETWOA 3245,3,256)3 
ONS SOUND (CONE TWP 324E65,4,10274)3 
POINTER: =CNEPOINT? 

ONE SOUNN (CFIRSTLAST5,4916) 
EIN) 5 

COMMENT exe eWNQOS WITH SIX €& GPZATER SOUNDS XX *Ke*R*: 
BEGIN 

ONE SOUNC(CNES .0,4)5 

ONF PCINT:s=POINTER: 

ONE SQUND CONETWO6,1,16)3 
TWOPCINTs=PNINTERS 
ONESQUND(ONETWO36,2,654)3 
ONESQUNC(ONET WO 346 .-3,256)3 
FOUR POINT: =POINTERS 

POINTER :=03 


166 





e 
9 
es 
9 


= _ 

+ + ee 

~~ _ = 

~ a + ee 

Uw ie oo e— 

CD ew a ox 4 oe eo 

ae Po =) cd oO + 

2 = = Us as oe 

“ry = _) On «x 

u U) Ug > meet sya — 

Ud: tw ~ —' CG qa) + 

Jz a wt a en 2c = 

=) Qa Ni >= => > a 

WZ Ym OQ tee 2] WwW 

= a —, ( e eo HD 

Oo Oo <j WO Usb = 

t~ — u'é ww a Co (S = 

Nw ao) Ww oC a Zz 

TWO m <t ww ws AO - 

ak eo © Oo we YW O 

te Y = © a. fy = UN 

Vid t- = =» =&€ OD WW 

Oz) wi kek Oe) AEN |) oS 

—r ge ww AL eet CC) tt: WO 

UM eG = Yu Meet “i 
eet Ce ait O Uj ed = aN LL 
Cr ech Li ee iT <1 ws iiies UL’ 
ite. Fee eez il It-uU A 
MO Weta tT oem om oe thea u _ 
2 UU! eo mt LI Rt CY 00 = oe hm oe tui 


SOK WOME HH OAg ene C= tH 

=O ee O Zee M ~ +t O“.0 @ eees 

0 DWH ¢D—>D Wom #+Wwee 

+2 MN0Ot 2OD UOResz=eMYNaACita 
eesti eS ot © &©FZ AK Det awe oot 
—zTUIDWe WEI - Ak-ZODK D—a@ 
tr OZOUZOGCF NE & e2eZce 
eZ ected eet eZ oe ee D 
NHaALTAOWDAO OCUOUUOWwws 
ONSYVIAMNUA NAATAQDOUODLO e« 
Ui! OW OD ODOWALDVHMTOA0 NK 
weer Fr OWI | Ow ww BMG 
OOMNnNYUS SSN OUOCOF~~m 
ZvNadeymrmmMmnstieczaeZzioow 
ee LI toe CIO ee Dee DDD OO NQeanins 
Owrr RWS RSeSBWOUWCOCUNnLWW 
NUNN NOE FEF UMUNMNUOWUIW ee 
Wait CO eww UI UANAING 
ZORRO ZZ ZaA ZAI StS 
OWNWULU HOOCKUVONdTdAAWHLUUW 


e 
9 


ANALYSIS 


RAW_ 


or gm 


0 
) 
D 


WU! 


167 





Tr 7 a 


C AMMEN T 8 35 Ke Me oe oe ei Be te ee ei te ee ee te ae ac ek he is fe ote te ae ee te ee ce he ot ie he te ae ok 
PROCEDURE COMPUTE _ANY TS EXECUTED ONCE FOR EACH APRAY 
CONTAINING FREQUENCIES PER LANGUAGE. THE CAST ROW OF EACH 
DATA ARRAY CONTAINS THE TOTAL NUMBER OF WORDS WHICH HAVE 
FREQUENCY DATA STORED IN THE CORRESPINCING COLUMN, THIS 
TATAL IS DIVIDED INTO SACH FNTPY IN THE COLUMN GIVING 

A PERCENTAGE OF OQCCURZANFE FCR THAT SCUND COMB INATICN 

WITHIN THE GIVEN PARATITION. 

THE CALLING ARGUEMENTS APE: 

Lo INTEGER ITERATION: THE SUSSCRIPT OF THE LAST ROW IN 
THE ARRAY WHICH HAS FREQUENCY INFORMATION IN IT. IN 

GENERAL IT ROW DIMENSION OF THE ARPAY BETNG PASSED IN LESS 1 
2) RAW NUMBERS: THE ARRAY BEING NORMALIZED. 

eRe Mies He ok He ee wee aie oe ate ie Ae Sieh He ee We eK ee eo ee Oe oe ee ee ee eK ke ee ee KS 
PROCEDURE COMPUTE_ANY( INTEGER VALUE ITERATIONSRFEAL ARRAY 

RAW _NUMPERS(*% ,*))3 

BEGTN 

INTEGER Ts 

T := ITERATIGN#L: 

IF( RAW _NUMBERS(T NUMBER )A=9) THEN 

BEGIN 

Bee « -= 0 UNTIL ITERATION OO 

RAW_NUMBERS(K -NUMBEQ) 2 =RAW_ NUMBERS(K NUMBER) /QR AW NUMBERS 
(T»NUMBER) 

END s 

Sm CREPPUTE _ Any; 
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MAIN Th eae ae ae ete ome ae ee cee te ae hee ee he he at ate ic Se Nr Se ate te HEE ae HED ee REISE he te ok oe Oe oe 
PROCEDURE FINDSQUND IS CALLED FRRM "HECTRHE" AND RETURNS A 
VALUE OF "OQOINTES" CORRESPONDING TQ THE SUBSCRIPT NF THE 
SOUND IM THE ARRAY "CLASS." BECAUSE ALL THE SOUNN 
COMBINATIONS ARE PQWERS OF FOUR. IT IS A SIMPLE MATTER TO 
COMPUTE SAY A TWN-SCUND COMBINATION BY REMEMBERING WHAT THE 
FIRST SQUNOD WAS, MULTIOLYING TH= FIRST SOUND SUBSCRIPT BY 4 
AND ADOING TRE SECOAND SOUND SUBSCRIPT. THE BASIC [MEA NF 
MODULO G@ ARITHMFTIC. NA MATTER WHAT SOUND COMBINATION {S 
THE OBJECTIVE OF THE SEARCH, THE PROGRAM ONLY SEARTHES A 
mee «6©6GF hU6UFOUR SCLEMENTS. MWS" TELL "EF INDSOUND"™ WHICH SOUND 

IN THE wCRD TO LOOK AT. 

ke se ate oe te oi a ke ea see ae Be ote oa a ok te ak Pe ae oe se eke oe oe ok ok ok ea ook kek & He ek de kkk KKK 
PRCCENURE FINDSOUND( INTEGER VALUE WS)3 

BEGIN 

POINTER: =; 

WHILE(CQDE(WSILIA=CLASS(PONINTER)) O09 

POINTER: =POINTER+41 3 

END FINDSOUND3; 

COMMEND Tok oes sect oe tect ee Ae ee ee a tet OK oh oe oR ee oe oe te ak ote Kote cok oe oh oe ee oe 
PROCEDURE ONESOQUND TS SCALLED FROM RAW ANALYSTS. he 

NPEQATES ON THE SAME ORTNCTIPLE AS “FINDSSUND" EXCEPT THAT [tT 
ALSO INCREMENTS THE FREQUENCY COUNT [TM THE ADORNPRTATHE ARRAY 
NOUTPUT' AND INCREMENTS THE TOTAL WORD CCUNTER WITHIN 
WOYUTEUT." THE RAW OF "OIYTPUT® WHICH CONTAINS TRE WORD 
COUNTER MUST RE SOFTSTFIFEN BY THE ARGUENENT "LENGTH," 
See oe eee ee ee eX whe de ote ek et de ke se ot x= He EE RE AS ie Ae ee ie A a oe Ae he A ee ee ee ie oie eg 
PRAC EDURE ONE SOUND (REAL AQRAY CUTPUT(*, *)sINTEGEP VALUES 
BEGIN 

INTEGFR PINS 

PIN <= POINTERS: POINTER :=0;3 
WHILE(CODECIWS I 1LIA=CLASS( POINTER) ) NO 

POINTERS =POINTER 413 

POINTER 2 =PIN=44+PO0OINTEQ 35 

OUT PUT ( POINTER, NUMBER) 2? =OUT PUT (POINTER, NUMRER) +] 

OUTPUT (LENGTH,NUMB8F2) 2 =QUTPUTCLENGTH , NUMBER) + es 

OUTPUT (POINTER ,O):=1 

END ONESCUND; 
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COM MEN T 2 ee Bete oe oe ech ee See ee a Se ee eat ee ee oe oe i ee a oo oe ate ek ae oe oe ote oe oe ek 
PROCEDURE SINGLE_PRI3S CAMPUTES THE ACTUAL PROSABILITY AF A 
LANGUAGE GIVEN ANY ears SQUND WHICH THE PROGRAM LOOKS AT 
AS A SEPARATE ENTITY. [eewe (He Fises! SOUND OF EACH WHE) 
aw THE LAST ee. ne WOR OS SIX SCUND AND GREATER IN LENGTH. 
ae ee eS ADE THAT ALL LANGUAGES AR= EQUALLY 
el. 
PROBABLE: eusenane see Ke Me ee eek ok ae ee She Sen ae ek esos oe ee See he Ee ee xe Se oe ee oe oe =’ 
PROCECURE SINGLE_PRIBS3 
REGIN 
REAL 8,C DEF ,GeH3 
mee fF s= 0 UNTIL 32 DD 
REGIN 
Be=— 2=03=2°7=F £=6 2=H: =9% 
mee 2 c= | UNTIL NUMBER ON 
BEGIN 
B -= S4QNEL( I,J)? 
Mee Ct CHEZ(Y.J): 
mee— 0+ CNE3(1I,J):° 
BE = E+ ONESGTI. J): 
mee = Fe ONESTI, J): 
G = Gt ONFE(I,.J)3 
H2=HHE 6( I,J) 
END; 
FOR K <= 1 UNTIL NUMBFR AC 
BEGIN 
TF(Ba=0) THEN 
ONFICI «K)2=QNEL(II,K)/8 ; 
IF(C3=0) THEN 
ONE2Z (1 sK)S=ONE2Z(T,KI/C : 
eee ==0) THEN 
eee cues tts roy CS 
I=(En=0) TH 
ONE GUT OK) S2ONEG( TS KI/E ; 
IF (FA=0) THEN 
Meat) «K):=ONE5CTI,K I/F : 
IF(G62.=0) THEN 
ONS 4(1T KIS =ONF6(IZKI/G 3 
IF (H-=9) THEN 
BrmeigekK y2 =E6(1,K)/4H3 
END $ 
END ; 
eee SINGLE _ PRORS >: 


ie 


COMMENT xe seo 2 hea ic he Ak te ke ee oe ae ate oe oe de ac a oh ote te ake oe te ote ode fe oh ak eae oe te oe tes oe ote oe ate ak oe ok OK otek 
PROCEDURE GRAPH P2NVIDES A CONVENTENT GRAPHICAL 
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